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Abstract 



By means of the notion of umbrae indexed by multisets, a general method to express estimators 
and their products in terms of power sums is derived. A connection between the notion of multiset 
£^ ■ and integer partition leads immediately to a way to speed up the procedures. Comparisons of com- 

putational times with known procedures show how this approach turns out to be more efficient in 
eliminating much unnecessary computation. 
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d • 1 Introduction 

t— > 

It is acknowledged that an appropriate choice of language and notation can simplify and clarify many 
statistical calculations. In recent years, most of the work has been done by symbolic computation, main 
■ references are |14j . [3J. These books offer a variety of applications of symbolic methods, from asymptotic 

expansions to the Edgeworth series, from likelihood functions to the saddlepoint approximations. In [25j . 
Zeilberger describes a methodology for using computer algebra systems to automatically derive moments, 
, up to order 4, of interesting combinatorial random variables. Such a methodology is applied to pattern 

statistics of permutations. For different applications of computer algebra in statistics, see also [T5] . 

OO 

| The aim of this paper is to show the computational efficiency of umbral calculus in manipulating 

expressions involving random variables. 

The umbral calculus to which we refer is the version featured by Rota and Taylor in [20] . The basic 
device is the representation of a unital sequence of numbers by a symbol a, called an umbra, that is, 



the sequence l,ai,a2, ... is represented by the sequence I, a, a 2 , ... of powers of a via an operator E 
resembling the expectation operator of random variables. This approach has led to a finely adapted 
language for random variables by Di Nardo and Senato, see [7|- In [8], attention is focused on cumulants 
since a random variable is often better described by its cumulants than by its moments, as it happens 
for the family of Poisson random variables. Moreover, due to the properties of additivity and invariance 
under translation, cumulants arc not necessarily connected with moments of any probability distribution. 
As a matter of fact, an umbra seems to have the structure of a random variable but with no reference 
to a probability space, bringing us closer to statistical methods. In [5], it is shown that classical umbral 
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calculus provides a unifying framework for unbiased estimators of cumulants, called /c-statistics, and their 
multivariate generalizations. Moreover, within the umbral framework, a statistical result does not require 
a check of background details by hand, but becomes a corollary of a more general theorem. 

Here, we focus attention on the more general problem of calculations of algebraic expressions such as 
the variance of a sample mean or, more generally, moments of sampling distributions, which have a variety 
of applications within statistical inference [IS]. The field has, in the past, been marred by the difficulty 
of manual computations. Symbolic computations have removed many of such difficulties, leaving some 
issues unresolved. One of the most intriguing questions is to explain why symbolic procedures, which are 
straightforward in the multivariate case, turn out to be obscure in the simpler univariate one, see [2j. 

We show that the notion of multiset is the key for dealing with symbolic computation in multivariate 
statistics. Actually, at the root of the question, there are some aspects of the combinatorics of symmetric 
functions that would benefit if the attention is shifted from sets to the more general notion of multiset. On 
the other hand, due to its generality, umbral calculus reduces the combinatorics of symmetric functions, 
commonly used by statisticians, to few relations which cover a great variety of calculations. In particular 
umbral equivalences (|21[) smooth the way to handle any kind of product of sums. As example, by using 
equivalences ()21[) . we evaluate the mean of product of augmented polynomials in separately independent 
and identically distributed random variables. The resulting umbral strategy is completely different from 
those recently proposed in the literature, see for instance [24| . and computationally more efficient, as we 
show in the last section. 

Moreover, by means of the notion of umbrae indexed by multiset, we remove the necessity to specify if 
the random variables of a vector are identically distributed or not. The basic procedure consists in finding 
multiset subdivisions, which suitably extends the notion of set partitions. The strategy here proposed 
is a sort of iterated inclusion-exclusion rule [TJ [5] , whose efficiency is improved taking into account the 
structure of multiset and its relation with integer partitions. The result is the algorithm makeTab, given 
in the appendix. 

The paper is structured as follows. Section 2 is provided for readers unaware of classical umbral 
calculus. We resume terminology, notation and some basic definitions. In Section 3, we give a general 
procedure for writing down [/-statistics. Recall that many statistics of interest may be exactly represented 
or approximated by U -statistics . Such a procedure is based on the umbral relation between moments 
and augmented symmetric functions. The connection with power sums is analyzed in Section 4. The 
effectiveness of umbral methods is shown in several examples, proposed with the intention of helping the 
reader unaware of umbral calculus to understand the basic algebraic rules necessary to work with this 
syntax. Section 5 is devoted to umbral formulae giving power sums in terms of augmented symmetric 
functions. These formulae provide the most natural way to form the product of augmented symmetric 
functions by using a suitable umbral substitution. The consequent reduction of the computational time is 
made clear through some examples, which point out the role played by the singleton umbra in selecting 
the suitable variables. Section 6 is devoted to computational comparisons with the procedures known in 
the literature in dealing with moments of sampling distributions. The speed up of umbral methods is 
evident. Some concluding remarks end the paper. 

Although, by a numerical point of view, MAPLE seems to work less efficiently respect to MATHEMATICA, 
we have implemented our algorithms in MAPLE because the syntax is more comfortable for symbolic com- 
putation. All tasks have been performed on a PC Pentium(R)4 Intel(R), CPU 3.00 Ghz, 480MB Ram 
with MAPLE version 10.0 and MATHEMATICA version 4.2. 
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2 The classical umbral calculus 

Classical umbral calculus is a syntax consisting of the following data: 

i) a set A = {a, j3, . . .}, called the alphabet, whose elements are named umbrae; 
ii) a commutative integral domain R whose quotient field is of characteristic zero; 

Hi) a linear functional E, called an evaluation, defined on the polynomial ring R[A] and taking values 
in R such that 

a) E[l] = 1; 

b) E[a i P J ' ■ ■ • 7 fe ] = E[a l ]E[(3^] ■ ■ ■ E[-^ k ] for any set of distinct umbrae in A and for i,j,...,k 
non-negative integers [uncorrelation property); 

iv) an element e £ A, called an augmentation, such that E[e n ] = for all n > 1; 

v) an element u £ A, called a unity umbra, such that = 1 for all n > 1. 

Note that, for statistical applications, R is the field of real numbers. 

An umbral polynomial is a polynomial p £ i?L4]. The support of p is the set of all umbrae occurring in 

P- 

If p and q are two umbral polynomials, then 

i) p and 5 are uncorrelated if and only if their supports are disjoint; 

ii) p and q are umbrally equivalent iff i?[p] = E[q], in symbols p ~ q. 

The basic idea of the classical umbral calculus is to associate a sequence of numbers 1,02,03, .. . to an 
indeterminate a, which is said to represent the sequence. This device is familiar in statistics, when 
represents the i-th moment of a random variable X. In this case, the sequence 1, 01,02, ■ ■ ■ results from 
applying the expectation operator E to the sequence 1,X, X 2 , . . . consisting of powers of X. This is why 
the elements a n £ R such that 

E[a n ] =a n , n>0 

are named moments of the umbra a and we say that the umbra a represents the sequence of moments 
1, a\, a%, . . . . The umbra e plays the same role of a random variable which takes the value with probability 
1 and the umbra u plays the same role of a random variable which takes the value 1 with probability 1. 
The uncorrelation property among umbrae parallels the analogue one for random variables. In this setting 
no attention must be paid to the well-known "moment problem" . 

In parallel with random variable theory, the factorial moments of an umbra a are the elements at n ) £ R 
corresponding to the umbral polynomials (a) n = a{a — 1) • • • (a — n + 1), n > 1, via the evaluation E, 
that is E[(a)„] = dr n y 

There are umbrae playing a special role in the umbral calculus. Their properties have been investigated 
with full particulars in [S] . 

Singleton umbra. The singleton umbra x is the umbra whose moments are all zero, except 
the first E[x] = 1. Its factorial moments are X( n \ = ( — l) n_1 (n — 1)! As we will see later on, 
this umbra is the keystone for managing symmetric umbral polynomials. 
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Bell umbra. The Bell umbra (3 is the umbra whose factorial moments are all equal to 1, that 
is E[(f3) n ] = 1 for all n > 1. Its moments are the Bell numbers, that is the number of partitions 
of a finite nonempty set with n elements, or the n-th coefficient in the Taylor series expansion 
of the function exp(e* — 1). So /3 is the umbral counterpart of a Poisson random variable with 
parameter 1. 

It is possible that two distinct umbrae represent the same sequence of moments, in such case these are 
called similar umbrae. More formally two umbrae a and 7 are said to be similar when 

E[a n ] = E[<y n ] Vn > 0, in symbols a = 7. 

Furthermore, given a sequence 1, 01,02,... in R, there are infinitely many distinct, and thus similar 
umbrae representing the sequence. So, the umbral counterpart of a univariate random sample is a n- 
vector (ai, a%, . . . , ct n ), where a.i,i = l,2,...,n are uncorrclatcd umbrae, similar to the same umbra 
a. Thanks to the notion of similar umbrae, it is possible to extend the alphabet A with the so-called 
auxiliary umbrae resulting from operations among similar umbrae. This leads to construct a saturated 
umbral calculus in which auxiliary umbrae are handled as elements of the alphabet |20j . In the following, 
we focus the attention on auxiliary umbrae which play a special role. Let {a.\,a2 1 ■ ■ ■ ,a n } be a set of 
n uncorrelated umbrae similar to an umbra a. The symbol n.a denotes an auxiliary umbra similar to 
the sum ax + a% + • • • + a n . So n.a is the umbral counterpart of a sum of independent and identically 
distributed random variables. The symbol or" is an auxiliary umbra denoting the product ot\ ai ■ ■ ■ a n . 

Moments of or" can be easily recovered from its definition. Indeed, if the umbra a represents the 
sequence 1, ax, 0,2, ■ ■ ■ , then E[(cf n ) k ] = a£ for nonncgative integers k and n. 

Moments of n.a can be expressed through integer partitions. Recall that a partition of an integer i is 
a sequence A = (Ai, A2, . . . , A t ), where Xj are weakly decreasing integers and Y^,j=x -\? = The integers Aj 
are named parts of A. The length of A is the number of its parts and will be indicated by v\. A different 
notation is A = (l ri , 2 r2 , . . .), where rj is the number of parts of A equal to j and rx + r% + • • • = v\. For 
example, (l 3 , 2 1 , 3 2 ) is a partition of the integer 11. We use the classical notation A h i to denote that "A 
is a partition of i" . By using an umbral version of the well-known multinomial expansion theorem [S] , we 
have 

(n.a) 1 ~ *^2(n)„ x d\a x , (1) 

Xhi 



where the sum is over all partitions A = (l ri , 2 r2 , . . .) of the integer i, (n) Ux = for v\ > n, 

i\ 1 



and a x = (a n )^(aj 2 )^ ■ ■ ■ , (2) 



ri!r 2 !--- (l!) ri (2!) r 2 •■■ * ~ V Jiy V j2 

with {ji} distinct integers chosen in {1, 2, . . . , n} = [n]. 

The reader interested in proofs of identities involving auxiliary umbrae is referred to [7]. 

A feature of the classical umbral calculus is the construction of new auxiliary umbrae by suitable 
symbolic substitutions. For example, in n.a replace the integer n by an umbra 7. From (JTJ) , the new 
auxiliary umbra "f.a has moments 

( 7 .ar * 5>kdA«A (3) 

AH 

and it is called dot-product of 7 and a. The auxiliary umbra j.a is the umbral counterpart of a random sum. 
In the following, we recall some useful dot-products of umbrae, whose properties have been investigated 
with full particulars in [5]. 



5 



a-factorial umbra. The umbra a.\ is called the a-factorial umbra. Its moments are the 
factorial moments of a, that is (a.x) 1 — (o;)i- If a = x, then E[(x-xY] = ^[(x)i] = x (i) = 
(-l)-i(i-l)!. 

a-cumulant umbra. The umbra x-Q^ with x the singleton umbra, is called the a-cumulant 
umbra. By virtue of ©, its moments are 

AH AH 

Since the second equivalence in Q recalls the well-known expression of cumulants in terms of 
moments of a random variable, it is straightforward to refer the moments of the a-cumulant 
umbra x-ct as cumulants of the umbra a. 

3 [/-statistics 

In the following, we focus our attention on two kinds of auxiliary umbrae: n.a and n.(xa). Such umbrae, 
and their products, are similar to some well-known symmetric polynomials. Indeed, by definition we have 

n.a r = a\ + ■ ■ ■ + a£, 

where ai,a%, . . . ,a n are uncorrelated umbrae, similar to the umbra a. Since the umbrae at for i = 
1,2, ... ,n can be rearranged without effecting the evaluation E, the auxiliary umbra n.a r is similar to 
the r-th power sum symmetric polynomial in the indeterminates a\, a%, . . . , a„. 
Moreover, since 

n.{X<*) = Xl<*l H h Xn^n, 

powers of n.(x a ) are umbrally equivalent to the umbral elementary symmetric polynomials [?i.(xa)] fc — 
k\e k (ai, a 2 , . . . , a n ), where 

efc(ai, a 2 , . . . , a„) = ^ a n a j2 ■ ■ ■ a jk , k = 1, 2, . . . , n. 

l<ji<h<—<jk<n 

The proof is given in [5] and it relies on the role played by the umbra x in picking out the indeterminates. 
Example 3.1 If k = 2, then 

n 

[n.(xa)] 2 ~ (xiaiH h x«««) 2 - ^{Xtad 2 + 2 Xh<XjiXj 2 <Xh 

i=l l<ji<j2<n 

l<ii<j2<n 

The last equivalence follows by observing that (xiCti) 2 — for i = 1,2, ... ,n since E[(xiai) 2 ] — E[xf]E[af] 
for the uncorrelation property between ai and Xii an( i E[xf]E[af } = because £?[xf] = 0. On the other 
hand, we have Xji a ji Xh a j2 — a ji a j2 f° r 1 ^ .h < ]2 < n since the uncorrelation property among 
Xji,a h ,Xh,oij 2 implies E[x h a h Xj 2 ot h ] = E[xjAE[^h] E [Xh\ E [ a h^ but ^taJ^Kil^Ix^Ka] = 
E[a n ]E[a j2 ] because E[ Xjl ] = E[ Xj2 ] = 1 □. 

The auxiliary umbra n.(xa) enables us to rewrite umbral augmented symmetric polynomials in a very 
compact expression. Let A = (l ri , 2 r2 , . . .) be a partition of the integer i < n. Augmented monomial 



symmetric polynomials in the indeterminates ai, ai, • • • , a n are denned as 



G 



2 2 

>1 U >1+1 ' " a j'r 1+ r 2 



In statistical literature, a more common notation is [l Vl 2 T2 . . .] [13]. For instance, [1 2 3] denotes 

a h a h a j 3 ■ 

If A = (l ri ,2 r2 ,.. .) h i, then 

[n.(xa)P[n.( X a 2 )p---^[l r ^-...}, (5) 

taking into account the role played by the umbra x m selecting variables. We point out that the umbral 
notation is very similar to the notation [l ri 2 r2 . ..]. As before, we give an example in order to clarify 
equivalence ([5]). 

Example 3.2 If A = (1 2 ,3), then 

[n.(xa)] 2 [n.(xa 3 )) ~ (xi«i H h x„a„) 2 (xi«? H h 

^l<jl#i2<« / \l<J3<n / I<jl#i2#j3<n 

The last equivalence follows by observing that E[xj 1 Xj 2 Xj3\ vanishes where there is at least one pair of 
equal indexes □. 

In the following theorem, we give the umbral formulation of a fundamental expectation result in statistics, 
see |23) . This is a deep result because it lies at the core of unbiased estimation and moments of moments 
literature. 

Theorem 3.1 If X — (IT 1 , 2 1 " 2 , . . .) is a partition of the integer i < n, then 

a x ^-^—[n.{xu)] ri [n.{xo?)Y 2 ---. (6) 

See [9] for the proof. 

Equivalence © states how to estimate products of moments a\ by means of only n bits of information 
drawn from the population. In umbral terms, the population is represented by a and the n bits of 
information are the uncorrelated umbrae a\,a2, ■ ■ ■ ,a n , coming into [n.(xo: l )] ri . Moreover, having shown 
that the umbral polynomials [n.(xa*)] ri are similar to elementary polynomials, Theorem 13.11 discloses 
a more general result: products of moments are umbrally equivalent to products of umbral elementary 
polynomials. The symmetric polynomial on the right side of equivalence © is named U-statistic of 
uncorrelated and similar umbrae ot\, a-i-, ■ ■ ■ , ot n . We take a moment to motivate this denomination. Usually 
an [/-statistic has the form 

where X\, X%, . . . , X n are n independent random variables, and the sum ranges in the set of all permuta- 
tions {31,32, ■ ■ ■ 1 jk) of k integers with 1 < j\ < n. If Xi, X%, . . . , X n have the same cumulative distribution 
function F(x), U is an unbiased estimator of the population parameter 

9(F) = [■■■ [ &(x 1 ,...,x k )dF(xi)--'dF(x k ). 



In this case, the function $ may be assumed to be a symmetric function of its arguments. Often, in 
the applications, $ is a polynomial in X^s so that the {/-statistic is a symmetric polynomial. Hence, by 
virtue of the fundamental theorem on symmetric polynomials, such an [/-statistic can be expressed as a 
polynomial in elementary symmetric polynomials. 

Example 3.3 Moment powers. Let us consider the partition A = (l 2 ) of the integer 2. The symmetric 
polynomial 

[/ = -i-[n.( X a)] 2 , n>2, 
[n)2 

is the [/-statistic related to or 2 ~ a\. Indeed, setting n = 2 and v\ = 2 in ©, we have 



ff2 -zrE a ^ - 



where the last equivalence follows by expanding the square of n.{xot). □ 

Example 3.4 k- statistics. The i-th fc-statistic fcj is the unique symmetric unbiased estimator of the 
cumulant Ki of a given statistical distribution, that is E[ki] = Ki |23| . In umbral terms, we have 

( X . a y ^Et^T- d ^ n -(xa)r Mx« 2 )P • ■ • , (7) 

AH [n >^ 

by using equivalence (H| and Theorem [33] Equivalence {7} is the umbral version of the i-th fc-statistic fc^. 
□ 

Usually fc-statistics arc expressed in terms of power sums in the data points, S r = X)™=i '■ Umbrally, 
this is equivalent to expressing fc-statistics in terms of n.a r , that is to expressing products of auxiliary 
umbrae such as [n.(xct 1 )] r ' in terms of n.a 3 , for some j. Next section is devoted to exploring such relations, 
which also allow us to express moments of sampling distributions in terms of population moments. 

4 Augmented and power sums symmetric functions 

In this section we turn our attention to symmetric functions useful in computing moments of sampling dis- 
tributions, i.e augmented monomial symmetric functions and power sums, with special care in formula con- 
verting the former in terms of the latter and viceversa. Such polynomials are classical bases of the algebra of 
symmetric polynomials. The well-known changes of bases involve the lattice of partitions, see [21| . Several 
packages are available aiming to implement changes of bases (see http://garsia.math.yorku.ca/MPWP/). 
For instance, the SF package [H] is an integrated MAPLE package devoted to symmetric functions. The use 
of such packages requires a good knowledge of symmetric function theory and is not so obvious. Moreover, 
due to their generality, such packages are slow when applied to large variable sets. 

The connection between augmented symmetric functions and power sums has been given in umbral 
terms, this because umbral notation simplifies the changes of bases, taking advantage of multiset notion. 
In the following we summarize the steps necessary to construct such formulae in the most general case, 
which have applications in multivariate statistics. The reader interested in proofs is referred to [S]. 

The starting point is the expression of moments of n.(\ a ) m terms of n.a and viceversa: 



( X a)} 1 ~ ^d A ( X .x)A(n.«r(n.a 2 p..., (8) 

Ahi 

(n.aY ~ ^d A [n.( X a)]'>.( X a 2 )p.... (9) 



Ah, 



8 



Such equivalences involve integer partitions and are very easy to implement since there is at least one 
procedure devoted to integer partitions in any symbolic package. Note that equivalences (|HJ) and (0 may 
be rewritten replacing a with any power a k . For instance, in (0) we have 

(n.a k Y ~ Yl dx{n.( X a k )P [n.( X a 2k )P ■ • • . 

Ahi 

The next step is to express more general products [n.(xa)] ri [n.(xa 2 )] r2 ■ ■ ■ (that is augmented symmetric 
polynomials) in terms of power sums. With this aim, equivalences (j8J) and j9j must be rewritten by using 
set partitions instead of integer partitions. We say in advance that the final step will consist in replacing 
the set with the more general structure of multiset. 

Let C be a subset of R[A] with n elements. Recall that a partition it of C is a collection it = 
{Bi, £?2, • • • , -Bfc} with k < n disjoint and not-empty subsets of C whose union is C. We denote by n„ the 
set of all partitions of C. Let {ai, a.%, . . . , a n } be a set of n uncorrelated umbrae similar to an umbra a. 
The symbol cr* denotes the umbra 

.7T _ l-Bil \B 2 \ \B k \ / 1 

a =< < i ( 10 ) 

where 7r = i?2, ■ ■ • , B^} is a partition of {oa, Q2j • ■ ■ , ct n } and ii, i*2, . . . , «fc are distinct integers chosen 
in {1, 2, . . . , n}. Note that a ,7r = a\, when A is the partition of the integer n determined by tt. Indeed, 
a set partition is said to be of type A = (l ri ,2 r2 , . . .) if there are n blocks of cardinality 1, r2 blocks 
of cardinality 2 and so on. The number of set partitions of type A is d\, as given in ((2|). By using set 
partitions, equivalences jSj) and §§§ may be rewritten as 



in 



(n.aY ~ ^[n.( X «)] ri K(x« 2 )] r2 ---, (H) 

( X «)r ~ ^(x.x)- 7r (^r(^-« 2 r---, (12) 



Treri; 



where ^[(x-x)^] = ^ [(x-x) |Bl| (x'-x') |B21 ' ' ' = Ui=i x \B t \ from dTUJ> . From a computational 

point of view, equivalences (fTTj) and (j 1 2|) arc less efficient than equivalences <JSj> and (|9|). The computational 
cost is 0(B n ), where B n is the n-th Bell number whose growth is greater than e™. Anyway, equivalences 
(fTT|) and lO smooth the way to generalize such computations to the multivariate case, by using the 
notion of multiset. 

A multiset M is a pair (M, /), where M C is a set, called the support of the multiset, and / is 
a function from M to the non-negative integers. For each \i £ M, f(/i) is called the multiplicity of fi. If 
the support of M is a finite set, say M = {fj,i, /X2, . . . , /Afe}, we write 

M = {$^\$M>\..., or M={m 1i __mx j . 

/(Ml) 

The length of the multiset M is the sum of multiplicities of all elements of M, that is 

\M\ = J2 /(/*)■ 

From now on, we denote a multiset (M, /) simply by M. For instance the multiset 

M = {a,..., a} = {a {i) } 




has length i, support M = {a} and /(a) = i. In the following, we set 

lHi=H^ M and (n./i) M = ]J(n.fi)^. (13) 

For instance, if M = {cr^} then (n.a)M — (n.a)' 1 and [n.(xa)]Af — [^-(x^)] 1 - Note that this notation can 
be easily extended to umbral polynomials. 

If A = , 2 r2 , . . .) is an integer partition, set 

P\ = {a,...,a, 5t 2 ,...,a 2 ,...}. (14) 

ri r 2 

By using the notation (TT^J), we have (n.a)p x ~ (n.a) ri (n.a 2 ) r2 ■ ■ ■ and [n.{x a )]p x — [ n -(x a )] ri [ n -(x a2 )Y 2 ' ' ' ; 
so equivalences (jTTJ) and (|12|1 may be more compressed 

Hi/~^[n.(xa)] Pj and [n.(xa)] M - (X-X)' 7 ' (n.a) Px , (15) 
Trerii Trent 

where A is the type of the set partition tt. 

In equivalences (|15[) . we have M = {a^'}. The last step consists in generalizing such equivalences to 
any multiset M. To this aim, we recall the notion of multiset subdivision. Such a notion is quite natural 
and it is equivalent to splitting the multiset into disjoint blocks (submultisets) whose union gives the whole 
multiset. 

A subdivision of a multiset M is a multiset S = (S, g) of k < |M| non-empty submultisets Mj = (Mi, /;) 
of M such that 

i) uf =1 Mi = M; 

u ) Ei=i fi(^) = /(/■*) for an y M € M. 

Recall that a multiset Mi = {Mi, fi) is a submultiset of M = (M , /) if M, C M and < /(/i), V/i £ Mj. 
If A/ = {a^}, then subdivisions arc of type 




with r\ + 2 T2 + ■ ■ ■ = i, and we will say that the subdivision S is of type A = (l ri , 2 T2 , . . .) h i. The support 
of S is S = {{a},{aW},...}. 
If 

S = {Mi . . . , Mi, M 2 , . . . , Af 2 , . . . , M k , . . . , M fc }, (16) 

g(Afi) »(Ma) s(M fe ) 

we set 

M5=I1# ) and (n. M ) s = [] (n. m ) fl(A/i) , (17) 

AU£S AI t ES 

extending the notation (fl~3|) . 

When integer partitions are replaced by multiset subdivisions, the fundamental expectation result © 
becomes 

[n.(xi*)]s - N|5|/i' S , (18) 

with S given in (fTI))) and 

M- S = (mm 1 )^ Mi) ---(m'mJ- s(M ' s) . (19) 
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where fiM t are uncorrelated umbral monomials. 

By using the notation (fTT|) and recalling (fT4"|) . we have (n.a)p x = {n.a)s and [n.(xa)]p x = [n.(xa)]s, 
with A = (l ri ,2 r2 , . . .) hi and 5 the subdivision of type A. Then equivalences ([T5j) may be written as 
follows 

(n.a) M ~ 2J [«-(xa)]s and [n.(xa)] M ~ ^ (x.x)' ?r ("-")s- (20) 
7reni Trerii 

One more remark allows us to remove integer partitions from (|20[) which is necessary when the multiset 
M = {a^} is replaced by an arbitrary multiset. Let us observe that a subdivision of the multiset M may 
be constructed in the following way: suppose the elements of M to be all distinct, build a set partition 
and then replace each element in any block by the original one. In this way, any subdivision corresponds 
to a set partition n and we will write S„. Note that it is \S n \ = \tt\ and it could be S ni = S^ 2 for tt\ =/= 112, 
as the following example shows. 

Example 4.1 If M = {a, a, 7, 5, S}, label each element of M in order to have the set C = {a.\, a 2 , 71, Si, 62}- 
The subdivision Si = {{a, 7}, {a}, {S, S}} corresponds to the partition 7Ti = {{ai, 71}, {^2}, {Si, S2}} of C. 
Itis|5i| = 7r L I . Note that the subdivision Si also corresponds to the partition n 2 = {{«2, 7i}, {ai}, {Si, 62}}- 
□ 

Finally, equivalences (|15|) may be rewritten as follows 

(n.^) A/ ~ [n.(x/i)]s„ and [ti.(xh)]m ^ X! (x-xV^n-^sv, (21) 

where S„ is the subdivision of M corresponding to the partition it of a set C such that \C\ = \M\ = i. 
The symbolic expression of such equivalences does not change if the multiset M = {cr 1 '} is replaced by 
an arbitrary multiset. 

The following example shows the effectiveness of umbral notation in managing moments of sampling 
distributions. 

Example 4.2 If M = {^1,^1,^2} then (n.(J,)M — (n./zi) 2 (n./Z2). In statistical terminology, moments of 
(ti.h)m correspond to moments of the product of sums (X)"=i -^i) 2 E"=i ^») > where (Xi, Yi), . . . , (X n , Y n ) 
are separately independent and identically distributed random variables. In order to apply the first part 
of (|2Tj) . we need to compute all subdivisions of M. These are given in Table 1. 







[«-(xm)K 


{{m,m,n 2 }} 


1 


"■■(XMiA^) 


{{Mi},{Mi,M2}} 


2 


["•(XMi)][«-(XMiM2)] 


{{M2},{Ml>Ml}} 


1 


[«-(XM2)][n.(xMi)] 


{{M>{Mi}>{M2}} 


1 


[n.(xMi)] 2 [n.(x/^ 2 )] 



Table 1: Subdivisions of M = {/xi, yUi, fj, 2 }. 



The last column in Table 1 has been constructed by the following considerations. Suppose to consider 
the second row: Si = {{/^l}, {fJ-i, M2}} is a subdivision of M = {fii, fXi, fx 2 }. The support of Si consists 
of two multisets, Mi = {fJ.i} and M2 = {/Ui,/i2}, each of one with multiplicity 1 so that [n.(xn)]si — 
n.{xi*M 1 )n.(xtJ'M2)- Since n.(xMA/J = n.(xm) and n.(xfiM 2 ) = n.(xMiM2), we have [n.(x/")]si = ™-(XMi) ^(xMl/^O- 
Repeating the same arguments for all subdivisions of M, we get the results of Table 1. By using Table 1 
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and the first part of (f21j) . we have 

(n.iii) 2 (n.^ 2 ) ^ n.(xMiM2) + 2[n.(xMi)]K(xMiM2)] 



+ ["-(XM2)][n.(xMi)] + [n-(xm)} 2 [n-(xH2)}, 



that is 



vi=l 



^Tx^ + 2 ^ a\.y,v, 

E ^+ E A < v ^ 

l<i=£j<n l<i^j^k<n 



(22) 



In order to evaluate the mean of the sums, on the right hand side of (|22[) . in terms of population moments, 
we have to use (|T8|) and finally we have 



E 



= n£LX 2 Y~] + 2(ti) 2 £[I]E[IF] 
+ (n) 2 £LX 2 ]£[y] + (n) 3 S[X] 2 S[y] 



□ 



In the previous example, we have shown the usefulness of the first part of (|2ip in evaluating the mean of 
product of power sums. In the next example, we show the usefulness of the second part of (|2ip in order to 
express any [/-statistic in terms of power sums. Indeed, first we make use of equivalence (|18p . in order to 
translate products of moments - also multivariate - in terms of augmented symmetric polynomials. Then 
we apply the change of bases given by the second part of (f2Tj) . 

The following example shows how to construct multivariate fc-statistics. 

Example 4.3 Multivariate k-statistics. In umbral terms, a multivariate cumulant is the element of R 
corresponding to E[(x-^)m] = n>t y ...t r t where M = {/Xj* 1 ^, /Lt^, ■ ■ ■ is a multiset and (x-M)m is the 

symbol denoting the product (x-^iY 1 (X-^Y 2 • • • (x-t i rY r - Suppose \M\ = i. The extension of (HJ) to the 
multivariate case is 

(X-M)M^ E(^) M M' 5 % 

where 5V is the subdivision of the multiset M corresponding to the partition n £ 11;. By equivalence (|18j) . 
we write 

(X-M)m - E (x-x) M 7-s — [n-(xv)]s„, 



and by the second part of (|2ip we have the umbral version of multivariate fc-statistics in terms of power 
sums, that is 

(x-x) M 



M 



E 



E (x-x)' T {n-n)s T 



(23) 

where S T is the subdivision of M corresponding to the partition r of a set having the same cardinality of 

7T. □ 



5 Products of augmented polynomials 
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This section is devoted to a different application of equivalences ([2"Tjl . necessary to evaluate the mean of 
product of augmented polynomials in separately independent and identically distributed random variables. 

We borrow the following example from the paper of Vrbik |24) . 

Suppose, for instance, to need the mean of 

J2 X ? X U \12 X i Y i) = S '{{2,0},{l,0}}<S'{{2a}}S'{{2,l}}> (24) 

^ } ) Vi=i / 

where (Xi, Yi), . . . , (X n , Y n ) are separately independent and identically distributed random variables and 

by means of the notation introduced by Vrbik in [21] . 

If we expand the product (|24p as a linear combination of augmented symmetric polynomials (|25p . then 
we are able to apply the fundamental expectation result (fT8|) and to evaluate the mean of (f2"4|) . The 
umbral tools, we have introduced up to now, are sufficient to do such a work. Therefore, since 

\ 

^jT\X?Xj corresponds to [tt-(xMi) n -(XMi)]i an( l 

^^X 4 2 K t ^ corresponds to [ft-(XMiM2)]> 

the product (|24p is umbrally represented by 

n-(xi(4) n-(XiVi) n.(x2^lm) n.(x3Mi/"2), (26) 

where {xi}>i = 1,2,3 are uncorrclatcd singleton umbrae. Indeed, a product of uncorrelated singleton 
umbrae does not "delete" the same indexed umbrae. Moreover, the sum (|25p is umbrally represented by 



(27) 
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Ok 


•5{{fci,;i},{fc2,i2},-} 


llAlPlJj lAlPl/; iA2MlP2Jj T.X3M1P2J/ 


>5 , {{2,0},{1,0},{2,1},{2 1 1}} 


tlAlPl! AlPl/! \A2MiM2/, \A3PlM2// 


n 


lAXlMl; X2M1M2 j, lAlPl/) 1X3^1^2// 


5'{{4,1},{1,0},{2,1}} 


iiXlMl; X3A i lA i 2|, 1X2^1^2/, \XlMl // 


5'{{4,1},{1,0},{2,1}} 


llXlPl; A2/ i lA i 2|, tXlMl/i \X3AnA*2 // 


5'{{3,1} ) {2,0},{2,1}} 


ttXlPl; X3/ i lA i 2|, tXlPlJ) \X2A*iA i 2// 


5'{{3,1},{2,0},{2,1}} 


\\X2MlM2, X3MlM2/, IXlMl/: IXlMl/J 


5'{{4,2},{1,0},{2,0}} 


\\XlMl> XlMl/> 1X2^1^2, X3A i lA t 2// 


n 
u 


{{XiMi. X2M1M2}, IxiMi, X3WM2}} 


S{{4,1},{3,1}} 


{{XlMl, X3M?M2>, {X2M1M2, X1M1}} 


S{{4,1},{3,1}} 


{{Xi/"i,X2MiM2,XiMi}, {X3M1M2}} 





\lXiMi> X3M1M2, X1M1/? \X2M1M2jT 


n 


11X2^1 P2 , A3PlM2;Alf t lJ;lAl/ i l/J 


^{{5, 2}, {2,0}} 


{{X2/^M2, X3M?M2, XlMl}, {XlMl}} 


S{{6,2},{1,0}} 


{{XiMi, XiMi,X2MiM2, X3M1M2}} 






Table 2: Subdivisions of il/ = {xiMi, XiMi: X2M1M2, X3M]>2}- 



Let M = {xiMii XlMii X2A t iM2, X3A t iA t 2}- The length of M is 4, and since the monomials arc all different, 
M is a set. From the second part of (|T3|) . we have 

(n./i) M - ["-(xiMi)«-(xiA i i)][«-(X2MiA i 2)]["-(x3A i iM2)]- (28) 
Via the first part of equivalence (j2"Tj) . we have 

["•(XiMi)«-(XiA*i)][«-(X2AtiA*2)][n.(x3MiM2)] - ^ [n.(x/")]7r, (29) 

where SV = ir, since M is a set, 1I4 is the set of all partitions of M and [i is an element of M. 
We must pay attention to the auxiliary umbra [n.(x/i)]„-, whose structure looks like (|27|) . For exam- 
ple, let us consider the partition 7Ti = {{xiMi) X2M1M2}, {xiMii X3/ i i/ i 2}}- Observe that [n. (xA 4 )] tti = 
"-•(XiX2A i iM2) "-(xiX3MIA*2) ^ n.(x/"iM2) «-(XM?A*2), corresponding to 5{{ 4 ,i},{3,i}} in If we consider 
^2 = {{xiMijXi^i}, 1X2^1^2, X3A i iA i 2}} , we have [n.(x/^)]^ 2 = Mi) «-(XX2X3 MiMi)- We have 

n -(x X2X3 Mi/ 1 !) — n -(XMiM2) w ith the deletion of indexed singleton umbrae, but ^.(xXiMi) — 0; since 
~ 0. So we have [n.(x^)]7r 2 — 0. 

In conclusion, when in one - or more - blocks of the subdivision S n , there arc at least two umbral 
monomials involving correlated singleton umbrae, the auxiliary umbra [^•(x/ i )]s„ b as the evaluation equal 
to zero. If within every block of the subdivision there are only uncorrelated singleton umbrae, then 
[n.(xn)]s^ gives rise expressions like (f2"T|) . umbrally representing (|2"5|) . 

We do all computation in Table 2. We give the corresponding sum of independent random variables 
(|25p . instead of [n.(xfJ-)]s w ■ Subsection 6.1 is devoted to the algorithm makeTab, which allows us to construct 
multiset subdivisions similar to those in the first column of Table 1 and Table 2. 
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6 Computational results 

6.1 Find subdivisions of a multiset: the procedure makeTab 

Equivalences ([21]) have been implemented in MAPLE. These equivalences share the procedure makeTab 
necessary to construct multiset subdivisions. 

When the multiset is of type {a^}, an efficient way is to resort the partitions of the integer k, as 
equivalences (|15|) show. In general, as already stressed in Section 4, we may construct multiset subdivisions 
by using suitable set partitions, but this approach has a computational cost proportional to the n-th Bell 
number B n , so it is not efficient. Indeed, examples have shown how subdivisions may occur more than 
one time in the same formula (see Table 1), so that it is necessary to build a procedure generating only 
different subdivisions together with their multiplicity, that is the number of corresponding set partitions. 
To accomplish this task, the algorithm makeTab takes into account the connection between multisets and 
integer partitions, reducing the overall computational complexity. 

In the following, we illustrate the main steps of makeTab by an example. Suppose to need subdivisions 
of the multiset 

M = {a^a, 7/y}. 

3 2 

We compute all different subdivisions of {o^ 3 '} by using all partitions A of the interger 3, that is 

{{a},{a},{a}}, {{a},{o (2) }}, {{« (3) }}- (30) 
The same we do for {7^}, that is 

{{7}, {7», {{7 (2) }}- (31) 

Now, we insert every element of (|31j) in every element of (|30j) one at a time and recursively, as the 
following example shows. Suppose to do the insertion of {{7}, {7}} in {{&}, {«}, {«}}• We first insert {7} 
in every block of {{a}, {a}, {a}}, that is 

{{a}, {a}, {a}} <- 7. 
Then, we insert a second time {7} in the output subdivision, that is 

({{a}, {a}, {a}} ^{7}) -{7}. 
The insertion <— is a kind of iterated inclusion-exclusion rule [T], but with some more constraints: 

i) the insertion of a submultisct of {7^} in a submultisct of {o^ 3 )} must be done only if it does not 

generate a new submultisct equal to a previous one or it has not yet inserted; 

ii) at the end, every submultisct of {7^} is simply appended to every subdivision of {ct^ 3 )}. 

Table 3 gives the results of the double insertion of {{7}, {7}} in every submultiset of {a^}, according to 
rules i) and ii). 
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Subdivision 


Output first <— 


Output second <— 


r r "i r t r 1 1 

{{a},{a},{a}} 


r r iririT 

{{a, 7}, {a}, {a}} 


rr ir t r t l 

{{a, 7}, {a, 7}, {a}} 




{{a}, {a}, {a}, {7}} 


{{a, 7}, {a}, {a}, {7}} 
{{«},{"},{"}, {7}, {7}} 


{{a},{a&}} 


{{a, 7 },{a (2) }} 
{{a},{a< 2 ),7}} 
{{a},{a( 2 )},{ 7 }} 


{{a, 7 },{« (2) ,7}} 
{{a, 7 },{a( 2 )},{ 7 }} 

{{a},{a( 2 ), 7 },{7}} 

/J^A /^,(2)l 

{{a}, {a^ ^,{7)) 


{{« (3) }} 


{{« (3) ,7}} 
{{« (3) },{7}} 


{{« (3) 7,7» 
{{« (3) },{7},{7}} 



Table 3: Insertion of {{7}, {7}} in every submultiset of {a*- 3 - 1 } 

Table 4 gives the results of the insertion of {{7^ 2 - > }} in every submultiset of {a( 3 )}, according to rules i) 
and ii). 



Subdivision 


Output first <— 


{{a},{a},{a}}^{ 7 ( 2 )} 


{{a, 7^}, {a}, {a}} {{a}, {a}, {a}, { 7 < 2 >}} 


{W,{a( 2 )}}^{ 7 ( 2 '} 


{{«,7 (2) },{« (2) }} {W,{« (2) },{7 (2) }} 
{{a},{a( 2 ), 7 (2) }} 


{{a (3) }} ^ {7 (2) } 


{{« (3) ,7 (2) }} {{« (3) },{7 (2) }} 



Table 4: Insertion of {{7 (2) }} in every submultiset of {a^ 3 ^}. 

This strategy is speedier than the iterated full partition of Andrews and Stafford [1] , given that it takes 
into account the multiplicity of all elements of M. The higher this multiplicity is, the more the insertion 
procedure gives efficient results, considering that it involves more than one element of M. At the end, we 
need to compute the number of set partitions in II5 corresponding to the same subdivision. For a given 
subdivision S, this is 



Ai, A2 



Ai 



■ Vj 



ci!c 2 ! ■ • • 

where (Ai, A2, • . • , A&) is the partition of 3 giving the number of times that a appears in every submultiset 
of S, (rji, r)2, . . . ,r]j) is the partition of 2 giving the number of times that 7 appears in every submultiset 
of S and ct is the multiplicity of every submultiset in S. 
As example for {{a, 7}, {a, 7}, {a}} this number is 



3 

1,1,1 



2 
1,1 



1!2! 

When there are monomials involving correlated singleton umbrae in the multiset M, see equivalence 
(f2"5)l . we have further speeded up the procedure. When in the same block of the subdivision, there are 
monomials involving more than one correlated singleton umbra, the evaluation of umbrae indexed by 
this subdivision does not give contribution. Then, if we check the indexes of singleton umbrae before the 
insertion procedure, we can delete the subdivision from the list and reduce the overall computational time. 
For example, if we have {{xiai, X2&2}, {X2&2}}-, the insertion of Xi a i may be done only in the second 
set, because the first one gives a zero contribution in the overall evaluation. 
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6.2 ^/-statistics 

Table 5 shows computational times of three procedures implementing the change of bases from augmented 
symmetric polynomials versus power sums, which is at the bottom of the construction of [/-statistics. The 
three procedures are the function AugToPowerSum given in MathStatica (release 1.0) |19) . the function 
TOP given in SF (version 2.4) [55], and our MAPLE function augToPs, with which we have implemented the 
second part of (|2ip . Comparisons have shown how augToPs performs its task using less computational 
time than all the others. 



[l l 2^3 fc ■ ■ •] 


TOP 


AugToPowerSum 


augToPs 


[l 5 2 3 3 2 ] 


0.78 


0.18 


0.13 


[l 6 2 3 ] 


0.08 


0.01 


0.01 


[2 10 ] 


2.57 


0.03 


0.01 


[1 B 2 7 3 1 ] 


6.15 


1.20 


0.65 


[l 2 2 2 3 2 4 2 ] 


2.75 


0.11 


0.09 



Table 5: Comparison of computational times. 



Unlike our MAPLE algorithm, note that AugToPowerSum and TOP do not work on multiple sets of variables 
so no comparisons can be done. 

In order to compare the results achieved by means of umbral methods with those of [2], we have 
performed symbolic computations involved in unbiased estimators of product of univariate and multivariate 
cumulants. 

The symmetric statistic fc r ,...,t such that E\k r ,... where K r , . . . , Kt are univariate cumu- 

lants, is known as polykay. Being a product of cumulants, the umbral expression of a polykay is simply 

k r _ t ^{ X -a) r ---{x'-a')\ (32) 

with \i ■ ■ ■ iX' being uncorrelated singleton umbrae and a, . . . , a' satisfying a = . . . = a! . If r+ • • • + t < n, 
the right-hand product of (|32|) has the following umbral expression in terms of power sums: 

k r ,..., t = 2^ ivi C*-x) ( n - a )s^ ( 33 ) 

(Ah,,...,r,R) W"x+-+^ neU Vx + ..^ 

where SV is the subdivision of the multiset 

P A+ ... + , = {a(^"+ t '),a 2(r2+ - +t2) ,...}, 

corresponding to the partition 7r 6 n, /A ^ |_„ . In analogy with (|32p . a multivariate polykay is a product 

of multivariate cumulants, that is E[kt 1 ...t r ,...,h...l m ] = Kti— t r • • • K h—l m j where /tt 1 ...j r , K>i-L—i m are 
multivariate cumulants. Products of multivariate cumulants are represented by products of uncorrelated 
multivariate a-cumulant umbrae, that is 

k tl ...u,...,h...l m - (x-m)t ' ' • (xVk, (34) 

where x an d X are uncorrelated and the umbral monomials fi S T and y! G L are such that 

T = {^\...,^},...,L = { fJ ,f \...,»'t ) }- 

If \T\ + ■ ■ ■ + \L\ < n, the right-hand product of (|3"4"1) has the following umbral expression in terms of 
multivariate power sums: 

!■■>., /•.../ £ {X - X) ^--- {X '- X ' )H (x-x\ T (n.p)s T , (35) 
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where S T is the subdivision of the multiset obtained by the disjoint union of T, . . . , L with no uncorrelation 
labels and corresponding to the partition t of the set built with the blocks of {n, . . . ,tt}. The umbral 
formulae here recalled are stated in [5]. 

Table 6 shows computational times obtained by the procedure PolyK of MathStaticaQ, by our MAPLE 
function polyk implementing (|34j) . and by the procedures proposed by Andrews and Stafford [1] (in 
particular, see [I] for multiple sums). Note that ([55]) gives as special case both ()32p and univariate and 
multivariate fc-statistics ([2"3"|) . We remark that MathStatica has no procedure to handle multivariate 
polykays [§ The computational times of Andrews and Stafford's procedures have been obtained by the 
code available at http: //fisher. utstat.toronto.edu/david/SCSI/chap. 3. nb. 



7 

fa,..., 1 


A j Hi re j 

Andrews-Stafford 


MathStatica 


MAPLE 


fes 


0.10 


0.09 


0.05 


fci n 


0.35 


0.25 


0.15 


kn 


1.19 


0.84 


0.42 


ku 


3.93 


2.67 


1.29 


km 


12.74 


8.54 


3.86 


kis 


40.44 


30.32 


12.3 


h,e 


39.84 


0.81 


0.51 


fcg,3 


18.32 


0.84 


0.46 


^9,6 


676.14 


5.02 


3.13 


^9,9 


> 1.5hh 


29.10 


23.19 


^3,3^2,2 


4.67 


n.c. 


1.71 


&3,3&3,3 


32.16 


n.c. 


15.87 


k.2, 1,1*2,1,1 


1.031 


n.c. 


0.52 



Table 6: Comparison of computational times. 
The acronym "n.c." stands for not calculable. 



Remark that the output expressions of Andrews and Stafford's code are unpractical. These are very 
different from the output expressions of PolyK of MathStatica and polyk in MAPLE. For example, the 
output of Andrews and Stafford's code for k$ is 

whereas the output of PolyK of MathStatica and polyk in MAPLE is 

n 2 S3 — 3nS\S2 + 2Sf 
n(n- l)(n-2) 

In ki2, the expression of Andrews and Stafford's code consists of 602 terms compared with 77 terms of 
the expressions obtained by PolyK of MathStatica and polyk in MAPLE. In order to recover the same 
output of PolyK of MathStatica and polyk in MAPLE in (|36p . we must group the terms in parenthesis 
over a common denominator, deleting equal factors in the results. This operation increases the overall 
computational time. For example the computational time of fcio grows from 0.35 to 2.693, the one of fci2 

1 Of course, if single fc-statistics are enough to be computed, the function k-stat of MathStatica is suitable and faster. 
2 In the forthcoming MathStatica, release 2, the procedure to handle multivariate polykays is now availbale (C. Rose, 
private communication). 
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grows from 1.191 to 14.56. For fc-statistics of order greater than 14, an error occurs since the recursion 
exceeds a depth of 256. 

6.3 Product of augmented symmetric functions 

The MAPLE routine Pam implements equivalences such as (|2l?|) . Pam calls the routine makeTab. Considering 
that there are monomials involving singleton umbrae in the multiset M, the efficiency of makeTab improves, 
as we have mentioned at the end of Subsection 6.1. In Table 7, we compare some computational times of 
Pam with those of the routine SIP, written in MATHEMATICA language by Vrbik [24], and exclusively devoted 
to products of augmented symmetric functions. 



[P2^'3 fc • • •] 


SIP 


MAPLE 


[5 3 9 10][12345] 


0.7 


0.1 


[5 3 8910][12345] 


5.6 


0.4 


[678910][12345] 


2.2 


0.1 


[6 78 9 10] [12] [3 4 5] 


3.1 


0.4 


[6 7] [8 9 10] [12] [3 4 5] 


4.7 


1.3 


[5678910][12345] 


16.7 


0.3 


[5678910][123456] 


348.7 


1.5 


[678910][6 7][345][1 2] 


125.6 


16.4 



Table 7: Comparison of computational times. 

Note that the computational time of SIP depends heavily on the number of variables involved in the 
brackets, because it resorts set permutations, whose computation cost is factorial in its cardinality. 

7 Concluding remarks 

This paper focuses attention on a symbolic calculation of products of statistics related to cumulants or 
moments. Undoubtedly, an enjoyable challenge is to find efficient procedures to deal with the necessarily 
huge amount of algebraic and symbolic computations involved in such a kind of calculations. The methods 
we propose result more efficient compared with those available in the literature. Note that high order 
statistics have a variety of applications. Recently, Rao [T7] have shown applications of high order cumulants 
in statistical inference and time series. Indeed, there are different areas, such as astronomy (see |16) and 
references therein), astrophysics |10j and biophysics [15j . where one computes high order fc-statistics in 
order to recognize a gaussian population or characterizes asymptotic behavior of high order fc-statistics if 
the population is gaussian. Indeed, fc-statistics are independent from the sample mean if and only if the 
population is gaussian [13| and in such a case fc-statistics of order greater than 2 should be nearly to zero. 
For such applications, increasing speed and efficiency is a significant investment. 

As we have shown, the codes of Andrews and Stafford arc quite inefficient for the problems posed here. 
This paper has pointed out the role played by the notion of subdivision in speeding up the calculations 
resulting by multiplying sums of random variables and the role played by the umbra \ m selecting the 
involved variables. The symbolic algorithm we propose, in order to evaluating the mean of product of 
augmented polynomials in random variables, relics on this innovative strategy. 

In closing, we would like to emphasize that classical umbral calculus not only decreases the com- 
putational time, but offers a theory to prove more general results. Recently, L-moments and trimmed 
L- moments have been noticed as appealing alternatives to conventional moments, see [12] and [§]. We 
believe that the handling of these number sequences would benefit by an umbral approach. 
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8 Appendix 

In the following, we present the MAPLE code of the procedure giving subdivisions of a multiset. In order 
to have the results of Table 3 and 4, the calling syntax is makeTab(3,2). 



nRep := proc(u) 

mul(x[2] ! ,x=convert (u, multiset) ) ; end: 
# # 

URv := proc(u,v) 
local U,ou, i ,ptr_i , vl ; 



ou:=NULL; U: = [] ; vl : =indets (v) ; 

for ptr_i from nops(u) by -1 to 2 do 

if has(u[ptr_i] ,v) then break; fi; od; 
for i from ptr_i to nops(u) do 

if not (u[i]=ou or has (u[i] , vl) ) then ou:=u[i] ; 

U: = [op(U), [op(u[l. . (i-l)]),u[i]*v,op(u[(i+l) . .-1])]] ; 

f i; 

od; op(U) , [op(u) , v] ; end: 

# # 

URV := procO 
local U,V,i; 

U: = [args[l,l]] ; V:=args [2, 1] ; 
for i from 1 to nops(V) do 

U := [ seq( URv(u,V[i]), u=U ) ]; 
od; seq( [x, args [1 , 2] *args [2 , 2] /nRep(x)] ,x=U) ; end: 
# # 

URmV := procO 
local U,i,nbin; 
if nargs=l then 

U: =args ; 
else U:=URV( args[l], args [2]); 
for i from 3 to nargs do 

U:=seq( URV( u, args [i] ) , u=[U] ); 
od; fi; 
U; end: 



# # 

comb := proc(V,ptr , Y) 

if ptr=nops(V)+l then return(Y); fi; 

seq( comb(V, ptr+1, [ op(Y) , L ] ), L=V[ptr]); end: 
# # 

makeTab := procO 



local U; 

U: = [seq( [seq( [ [seq(P I I i~z,z=y)] , 

combinat ['multinomial '] (args [i] , seq(r ,r=y) )] , 
y=combinat ['partition'] (args [i] ) )] , 
i=l . .nargs)] ; 

if nops(U)=l then [seq( [x [1] ,x [2] /nRep(x [1] )] ,x=op(U) )] ; 

else [seq(URmV(op(x)) ,x= [comb(U, 1 , [])])] ; 

f i ; end : 
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